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(57) Abstract 

Provided is a method for categorising nucleic acid, which method comprises producing a nucleic acid population by action of an 
endonuciease on double-stranded nucleic acid, such that each nucleic acid in the nucleic acid population has a double-stranded portion, 
contacting the nucleic acid population with one or more oligonucleotide sequences, and isolating nucleic acid which correctly hybridises 
to an oligonucleotide sequence, wherein each oligonucleotide sequence has a pre-determined recognition sequence, the nucleic acid being 
categorised by its ability to correctly hybridise to oligonucleotide sequences having the recognition sequence, the recognition sequence being 
situated such that it recognises a sequence in the double-stranded portion of the nucleic acid, one or more different recognition sequences 
being represented in the oligonucleotide sequences. 
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The present invenrion concerns a metliod for categorising nucleic acid. In particular, the 
invention concerns a mediod for sortii:^ nucleic acid, which method permits reduaion in die 
complexity of a nucleic acid populadon of approximately one order of magnitude, or more. 
The invention also lelates to a kit for carrying out the above method. 

Analysis of nucleic acids is fuudaraenial to much of modem molecular biology. A particular 
feature of nucleic acids derived from living organism is thai they are ahxiost invariably 
complex populadons of sequences present in widely varying quantiues. In order to 
characterise these populations of nucleic acids it is xisual to anempi to reduce the complexity 
of the population of nucleic acids in some way. Traditionally die approach has been to clone 
complex nucleic acid molecules into vectors to allow them to be isolated and either sub- 
cloned further or analysed direcdy. Cloning requires the use of biological hosts and these are 
often difficult to use and require a great deal of specialist knowledge for the cloning 
procedures to be successful. The tradidonal processes of cloning to generate libraries of 
sequences are also only partially automatable. 

A problem which cloning does not address is how to isolate sequences which are present only 
at low copies in backgrounds of sequences present at high copy numbers. Various techniques 
have been developed to ^normalise^ complex nucleic acid populadons prior to cloning in order 
to increase the quantities of sequences at low copy numbers relative to those at higji copy 
numbers. Subtractive hybridisation methods have been used to tiy and normalise cDNA 
populations. 

PCT/GB93/0 1452 describes methods of molecular sorting v^iiich uses restriction endonucleases 
that generate ambiguous sticky-ends in die nudrtc acid sample to be sorted. Adapters are 
designed widi sticky ends complemMtaxy to a single sticky-end sequence or a subset of die 
these ambiguous sticky ends such that the individual sticky end or subset thereof is coupled to a 
distinct sequence in the double stranded region of the adapter. This allows subsets of the 
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adaptored nucleic acid to be amplified using specific primers corresponding to sequences 
within the adapter which in turn relate to the sequence of the sticky end of the adapter, US 
patent 5,508,169 (issued November 7, 1995) describes methods very similar to those disclosed 
inPCT/Cfi93/01452, 

A problem with the above method is that the nucleic acids can be sorted only according co the 
sequence present on the sticicy-^nds of the nucleic acid. The sticky-end sequence is of limited 
length, as determined by die choice of restriction enzyme, thus the basis for sorting is 
limited. 

It is an object of the present invention to provide a method which overcomes the above 
problems, and provides a wider basis on which sordng of nucleic acid populations can be 
carried out, not limited by the sticky-end sequence. It is also an object of this invention to 
provide methods to reduce the complejdiy of nucleic acid populations by allowing them to be 
sorted into sub-populations without cloning and to permit normalisation of these populations. 
This invention describes methods of sorting nucleic acid molecules that have a variety of 
applications including gene expression profiling, preparation of templates for sequencings 
linkage analysis, etc. This invention provides methods of generating soned libraries. In many 
applications it is preferable that these sorted nucleic acids be captured on a solid phase supporc. 

Accordingly, the present invention provides a method for categorising nucleic acid, which 
method comprises producing a nucleic acid population by action of an endonuclease ou 
double-stranded nucleic acid, such that each nucleic acid in the nucleic acid population has a 
double-stranded portion, contacting the nucleic .acid population with one or more 
oligonucleotide sequences/ arid isolatiug nucleic acid which correctly hybridises to an 
oligonucleotide sequence; wherein each oligonucleotide sequence has a pre-determined 
recognition sequence, Che nucleic acid being categorised by its ability to correctiy hybridise to 
oligonucleotide sequences having ±e recognition sequence, the recognition sequence being 
situated such that it recognises a sequence in tiie double-stranded portion of the nucleic acid. 
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one or more differmc recognition sequences being represented in che oligonucleotide 
sequences. 

The presenc invention also provides kic for categorising a nucleic acid/ comprising one or 
more adaptors and one or more sets of oligonucleotide sequences/ wherein, the adapcors 
comprise nucleic acid having a double-stranded primer portion of a known sequence and a 
single-stranded portion of a pre-detennined length, either each single-stranded portion of each 
nucleic acid in the adaptors having the same pre-detennined sequence or all possible 
sequences of the single-stranded portion being represented In the adaptors, and wherein each 
oligonucleotide sequence comprises a first sequence, a second sequence attached to the first 
sequence and a third sequence attached to the second sequence, in which the first sequence is 
complementary to the sequence of the primer portion of the adaptor, the second sequence is 
ilie same sequence as the single-stranded portion of the adaptors or all possible second 
sequences of Che same lengdi as the single -snranded ponion of the adaptors are repjresented 
within the set of oligonucleotides, and the third sequence comprises a pre-determined 
recognition sequence. 

The invention will now be described in further detail by way of example only, with reference 
to the accompanying drawings, in which: 

Figure 1 shows a schematic of the treatment of a genomic DNA clone with a firequent cutting 
restriction endonuclease, such as Sau3AI, followed by ligation of adaptors to restriction 
fragments bearing specific primer sequences - all fragments are dealt with simultaneously, but 
for simplicity only one is shown; 

Figure 2 shows a schematic of an amplification step, following the steps of Figure 1, in 
which ft^gments are amplified by PCR using adaptor primers; 

Figure 3 shows a step following the step of Figure 2, in which amplified Segments are 
subdivided into 10, wells, each well being identified by a pair of primers used to son added 
molecules, each well initially containing one of Che pair of primers, there being 4 primers 
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each with one base probe sequence and each well having 1 of lO possible pairs generated by a 
combination of the four primers, the second primer being added after one cycle of synthesis 
of the first; 

Figure 4 shows a schematic of a differential amplification step* following the step of Figure 
3, in which the contents of a well containing a primer teraiinated with AC followed by a 
probe terminated by AG is amplified and then one cycle of synthesis is performed with the 
first primer and double strands captured widi avidinated beads; 

Fi|[Ure 5 shows a schematic of steps subsequent to those of Figure 4, in which die non- 
immobilised strand is melted off and washed away and the reaction residue polymerised, a 
second primer then being added and a second cycle of synthesis performed; and 
Figures 6 A and 6B show a schematic of steps subsequent to those of Figure 5, in which the 
non-immobilised strand is melted off and transferred to a fresh reaction vessel, and both 
primers are then added to the fresh free strand to amplify by PGR. 

In the present invention, the nucleic acid population is not isolated (such as by capture onto a 
solid phase) prior to contacting it with the oligonucleocide sequence(s). Thus each nucleic 
acid in the population may initially move freely in the suspension or solution in which it is 
contained. After contacting the nucleic acid population with the oligonucleotide sequenceCs). 
preferably only the nucleic acid(s) which have correctly hybridised to the oligonucleotide 
sequence(s) are isolated (preferably by capmre onto a solid phase). 

In more derail/the method of this invention may comprise the folio 

1. Restricting a large nucleic acid or population of large nucleic acids to generate fragments 
with known termioi. 

2. Ligating adaptors or linkers to the termini of these nucleic acid molecules. The ligated 
adaptor provides a known sequence at the termini of a population of nucleic acids which can be 
used to design primers which extend beyond the terminal adaptor sequence into unknown 
sequence adjacent to the known adaptor sequence allowing the unknown sequence to be 
probed. 
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3. Optionally amplifying the adaptored fragments using primers complementwy to the whole 

or part of the adaptor sequences at the teraiiii of the adapt^^^ 

4/ Optionally uormalismg the population of adaptored nucleic aci^^ 

5. Selectively amplifying subsets of the nucleic acids through the use of pairs of primers* 
which partially overlap into the unknown sequence. The overlapping primer will hybridise to a 
subset of xhe whole population. The size of the subset is determined by the length of overlap of 
the primer into the adjacent sequence. 

The methods of this invention may be applied cyclically to sub-populanons of soned nucleic 
acids generated by the methods of this invention. Each cycle funher reduces the complexity of 
the population. If necessary the cycles can be repeated vnd I unique nucleic acid is obtained. 

In a preferred embodiment the step of restricting nucleic acid is coupled to the ligation of 
adapters. Preferred restriction endonucleases for use with this invention cleave within their 
recognitidn sequence geaeraimg sticky-ends that do not encompass the whole recognition 
sequence. This allows ad^ters to he designed that bear sticky ends complementary to those 
generated by the prefenred restriction endonuclease but which do not regenerate the recognition 
site of the preferred restriction endonuclease. This means that if the restriction reaction is 
performed in the presence of ligase and adapters, the ligation of restriction fragments to each 
other is reduced by continuous cleavage of these ligatious whereas Kgation of adapters is 
irreversible so the presence of adapters drives the restriction to completion sind similarly the 
restriction endonuclease drives the ligation reaction to completion. This process ensures that a 
very high proportion of restriction fragments are ligated to adaptors/ This is advantageous as 
ligation of adapters to restriction fragments is a relatively inefficient process. This is due to 
random ligation of restriction products to each other if tiiese are phosphorylated. In this 
embo<Ement the adapters used are preferably not phosphorylated at their 5' hydroxyl groups so 
that they cannot ligate to themselves. 



• 
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GB 91 1 5407.0 describes a method of nonnalisiag a populatiori of nucleic acids comprising the 
following steps: 

1. Combining a mixture of heterogeneous DNA fiagments with oligonucleotide primers 
compatible with some nucleic acid amplification sysitm and denaturing the double stranded 
heterogeneous DNA, 

2. Altering the conditions, i-e* reducing the temperarure, to allow die more common species to 
re-anneal while preventing the primers from annealing to the DNA. The temperature for re- 
annealing at this stage must be higher than the melnng temperature of the PGR primers. 

3. Altering the reaction conditions furcher to allow the PGR primers to anneal to the remaining 
single stranded DNA which should represent the rarer species. 

4; Performing strand extension of the primed species, 

Advantageotisly/the above steps are applied cyclically a number of times to amplifir the rarer 
species to a significant extent. 

Application of this method to sequences with known termini permits the design of primers with 
very specific melting temperamres allowing the method to be used genericaliy. Use of this 
method is pairicularly advantageous in reducing the complexity of genomic DNA as a 
significant proportion of most genomic DNA is repetitive sequence* 

The advantage of providing a known sequence adjacent to probe sequence allows one to design 
libraries of probes, where all the probes in a library have the same melting teraperatore: This is 
advantageous as hybridisation of the entire library caa be performed-szmultaneously at a single 
temperature whilst retaining the stringency of hybridisation. 

Consider a large DNA fragment such as a mitochondrial genome or a cosmid or a microbial 
genome; To perform steps 1 to 4 of the method described above, such a large molecule can be 
cleaved wth a frequently cutting restriction ensyme to generate fragments of the order of a few 
hxxndred bases in length. If a restriction endonuclease like SauSAl is used fragments with a 
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known sriclcy end are left, to wiiich double stranded adaptors can be ligated. These adaptors will 
bear a known primer sequence, and a sticky end complementary to that produced by the 
restriction endonuclease to permit ligation. A combined restriction and ligation protocol as 
described above is appropriate. 

The majority of properly restricted fragments as a result bear an adaptor at each of dieir termini. 
This permits amplification of the adaptored restriction iBragments ax this stage if that is desired. 
After adaptoring and any non-selective amplification and noixnalisatioiv the nucleic acids can 
be differentially amplified to generate specific subsets of the starting population. The method of 
differential amplification preferably comprises the following steps: 

1. Dividing the adaptored population of restriction fragments into separate wells- If, for 
example, primers with an overlap of a single base are used then the adaptored fragments vvould 
be divided into 10 or 16 wells. 

2. Adding to each well one type of biotinylated primer of a predetermined set. The primer 
bears a sequence complementary to thai provided by ihe adaptor and restriction site. The primer 
additionally bears an overlap of a predetermined number of bases beyond the known sequence 
into the unknown sequence immediately adjacent to the restriction site. Primers with different 
overlaps are added to different well. Four primers are need if a ] base overlap is used. If 16 
wells are used each of the 4 primers are added to 4 wells. 

3 . Denaturing the amplified fragment population that was subdivided into each well by raising 
the temperature. The temperature is then reduced to permit the primer sequences to anneal. 
Primers preferably have equalised melting temperatures so that conditions for use of all primers 
are the same. 

4. Adding thermostable polymerase and nucleotides to extend annealed primers. 

5- Capturing the biotinylated strand extension products jSrom (4) onto a solid phase substrate 
derivitised wich avidin. This may be effected through the addition of avidinated beads. These 
may optionally be magnetic beads. 

6- Melting off the non-biotinylated complementary strand and washing this away. This leaves 
a single stranded copy of the selected fragments immobilised on the solid phase support. 
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7. To each of the separate pools is added one of the same set of primers as used in step (2) but 
not biotinvlaxed, such that each pool receives a differeni combination, of primers from this step 
and step (2)/ The primers should anneal to the single stranded capture molecules from (6), If 16 
pools are used, lo each is added one of tiae same 4 primers, but not biotinylated such that each 
of the 16 pools carries one of the possible different combinations of pairs of the 4 primers, 
8/ Extending the primed captured strands with polymerase and nucleotide triph 
9. Denaturing the free strand from the captured strand by raising the temperature. The 
'selected* free strand is thus released into solution. The liquid phase can be transferred to fresh 
reaction vessel or the solid phase suppon bearing the captured strands from (5) can be removed. 
This is very easy if the support -used are magnetic beads as these can be removed by 
electromagnetic attraction to a probe. 

The isolated free strands from (9) are thus isolated. At this stage the selected strands can be 
captured onto a solid phase suppon or amplified or the process of differential amplification can 
be repealed on the isolated subsets genemted to further sub-son these populations. This would 
be effected by using primers which overlap further into the unknown sequence adjacent to the 
known sequence of the adapter and the selected fragment. The sorted fragment could equally be 
cloned into a biological vector at this stage if desired. 

Generating a captured library is advantageous in that it facilitates easy manipulation of the 
library of fragments- Such manipulations include copying, amplification and probing of the 
library for particular sequences- A captured library dispenses with any requirement for 
biological cloning vectors to maintain the library as such a library can be readily copied using 
polymerases and nx^cleotide triphosphates. The c^rured library can be readily washed and can 
very easily be stored in a refrigerated environment. 

It should be noted in die example of primers that ov^lap by a single base, that die amplification 
products from the well containing a primer terminated by A followed by the primer terminated 
by G gives Che complement of the well where G is followed by A. It might therefore be 
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desirable to pool the reactions of where the same pair of primers are present but U5ed in a 
different order to ensure that both strands of each DNA molecule are present and captured on 
the solid phase supporL This would thus give 10 different pools. This is a couveziient ntimber as 
one can reduce the complexity of a library by one order of magnitude witih four primers. Each 
soned library of fragments can be jfinther sub-soned to an arbitrary degree. 

An aliemadve embodiment of this method uses primers already immobilised on a solid phase 
suppon, preferably covalently linked to the suppon instead of biorinylated primers in step (2) of 
the differential amplification process. Such solid phase supports can be magnetic beads, as 
described in EP-A-0 091 453 and EP-A-0 106 873. or die suppon could be polymer beads. 
PCT/GB92/02394 describes a solid phase polymer support in a micro-column v^rhere the solid 
phase suppon are beads of silica geL The beads arc retained between two frits in the column 
tihirough which solvents and reagents can flow. Such apparams is also applicable with this 
invention. 

One can clearly repeat the sorring process starting from a captured library that has been 
previously sorted. 

One can also clearly use just 10 wells to generate sorted populations as all of the sequence 
information in a series of 16 wells will be present if just the 10 different pairs of primer 
combinations are used. 

It should also be clear that labels can be introduced into sorted molecules by the primers used 
as part of the sorting process. Methods of introducing labels into primer oligonucleotides are 
well known in the arc. Bioiin has been discussed above, but many others are applicable. 

One can also use probes which overlap beyond the provided adaptor sequence to any extent. It 
becomes more difficult, however, to ensure the stringency of hybridisation as the number of 
bases extending into the unknown sequence from tiie adaptor is increased. 



wo 99/02725 - FCT/GB9S/02043 

■■ -lo- 

To effect higher degrees of soning one can either sort a sorted library with a set of four primers 
that overlap beyond the known lenninal sequences by a single base or one can use primers vyith 
a longer sequence overlap- To sort an adaptored population of nucleic acid fragments using 
primers with a 2 base overlap beyond the adaptor sequence, the adaptored population of 
restriction fragments is sub-divided into 256 wells. In each well is one of 16 biotinylaied 
primers which b^ar a sequence complementary to that provided by the adaptor and restriction 
site- The primers additionally bear an overlap of 2 bases beyond the known sequence into the 
unknown sequence immediately adjacent to the restriction site. The amplified fragment 
population subdivided into each well is denatured by raising the temperature and cooled 
allowing the primer sequences to armeaL Primers, again, preferably have equalised melting 
temperatures so that condiuons for use of all primers is the same. Thermostable polymerase and 
nucleotides are added to extend annealed primers. Blotinylated fra^ents are captured onto a 
solid phase substrate via avidiu and the complementary strand is melred off and washed away. 
To each of the 256 pools is added one of the same 16 primers, but not biotinylated such that 
each of the 256 pools carries one of the possible different combinations of pairs of the 16 
primers. Again, AT followed by GC gives the complement of the reaction of GC followed by 
AT so it might be desirable to pool these pairs to give a total of 1 3 6 pools. For an overlap of n 
bases, one can disiingmsh distinct sequences. If both termini of a molecule are used to select 
fragments then one can distinguish fragments into (4" x(4"-i-l)/2) distinct sets, since the 
orientation of each fragment is unknown- 
Sorting a library resolves fragments from a large, complex populaiiGn into defined sets whose 
size will be statistically regular and deterrninable as long as the size of the parent library is 
known, even if only approximately. The composition of the sorted library will be less complex 
than that of the parent library. This allows for useful manipuialions of a large library without 
loss of information as all the sequences present in the starting library should be present in one 
of the sub libraries as long as long as all of the possible sub-libraries are generated. This 
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method offers greater ease of manipulaiion of complex nucleic acid libraries and greater 



To put this invention into prEciise requires the construction of probe oligoBUcleotides (ONs), 
Precise control over hybridisation conditions will be required to ensure clean results in 
differential amplification. 

Details and reviews on the construction of labelled and* modified ONs are available in 
numerous up-to-date texts, see references 1 to 6 below, A brief discussion of preferred design 
possibilities is given below. 

There are major differences between the stability of short oligonucleotide duplexes containing 
all Watson-Crick base pairs. For example, duplexes comprising only adenine and thymine are 
unstable relative to duplexes of guanine and cytosine only. These differences in stability can 
present problems when trying to hybridise mixtures of short oligonucleotides to a target RNA. 
Low temperatures are needed to hybridise A-T rich sequences but at these temperatures G-C 
rich sequences will hybridise to sequences that are not fiilly complementary. This means that 
some mismatches may occur, and specificity can be lost for the G-C rich sequences. At higher 
temperatures G-C rich sequences -will hybridise specifically but A-T rich sequences will not 
^hybridise-'^'-;;: 

It is desirable tihat probes within a library behave in a similar manner, i.e, they should have 
similar melting temperatures and preferably also binding kinetics. In order to normalise these 
effects, modifications can be made to nucleic acids. Modifications fall into three broad 
categories: base modificaiions, backbone modifications and sugar modifications. 



precision of nianipulation than cloning into biological vectors. 



Base modifications 
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Numerous modifications can be made to tiie standard 'Waisou-Crick bases. The following axe 
examples of modifications that should nonnalise base pairing energies to some extent but they 
are not limiting: 

•The adenine analogue 2,6-dianiinopurine forms three hydrogen bonds to thymine rather 
than two and therefore fonns more stable base pairs. 

*The chymine analogue 5-propynyl dU forms more stable base pairs with adenine. 

*The guanine analogue hypoxanthine forois two hydrogen bonds with cytosine rather than 
three and therefore forms less stable base pairs. 

These and other possible modifications should make it possible to compress the temperature 
range at which short oligonucleotides can hybridise specifically to their complementary" • 
sequences. 

Backbone modifications 

Nucleotides may be readily modified in the phosphate moiety. Under certain conditions, such 
as low salt concentration, analogues such as methylphosphonates, triesters and 
phosphoramidates have been shown to increase duplex stability. Such modifications may also 
have increased nuclease resistance. Further phosphate modifications include phophodithirates 
and boranophosphates, each of which increase the stability of ONs. 

Isosteric replacement of phosphorus by sulphur gives nuclease resistant ONs (reference 7). 
Replacement by carbon at either phosphorus or linking oxygen is also a further possibility. 

Sugar modificaiions 

Various modifications to the 2" position in the sugar moiety may be made (references 12 and 
13)/ The sugar may be replaced by a different sugar such as hexose or the entire sugar 
phosphate backbone can be entirely jreplaced by a novel structure such as in peptide nucleic 
acids (PNA). For a discussion see reference 8. PNA may be the ideal choice as it forms 
duplexes of the highest thermal stability of any analogues so far discovered 
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Ariiftcial mismatches 

One major soiirce of error in hybridisation reactions is the stringency of hybridisation of the 
primers to the target sequence and to the unknown bases beyond. If the primers designed for a 
target bear single ardficially introduced mismatches the discrimination of the system is much 
higher (Zhen Guo er a/,. Nature Biotechnology 15, 331-335, April 1997); Additional 
mismatches are not tolerated to the same extent that a single mismatch would be when a fully 
complementary primer is used. Thus this can be exploited in the method disclosed above. If the 
probe used to extends beyond the provided sequence by I base, an artificial misraatch, 1 helical 
turn away from the probe base destabilises the double helix to a considerable degree if there is a 
second mismatch at the probe site. 

Details' on effects of hybridisation conditions for nucleic acid probes can be found in references 

■9 to ii; , . 

Mass labels for use in the present invention are discslosed in patent application 
PCT/GB98/00127, Farther labels for use in the present invention are discussed in the UK 
applications of Page White & Farrer file numbers 87^^ 87821, 87900- 



• 
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Claims: 

1 . A method for categorising nucleic acid, which method comprises producing a nucleic 

acid population by action of an endonuclease on double-stranded nucleic acid, such that each 
nucleic acid in the nucleic acid population has a double-stranded pordoti, contacting the nucleic 
acid population with one or more oligonucleotide 5equence$, and isolating nucleic acid which 
correctly hybridises to an oligonucleotide sequence by capturing the oligonucleotide sequence 
on a solid phase, wherein each oligonucleotide sequence has a pre-detetrnined recognition 
sequence, the nucleic acid being categorised by its ability to correctly hybridise to 
oligonucleotide sequences having the recognition sequence^ the recognition sequence being 
situated such that it recognises a sequence in the double-stranded portion of the nucldc acid, one 
or more different recognition sequences being represented in die oligonucleotide sequences. 

2- A method according to claini I, wherein the endonuclease is selected such that each 

nucleic acid in the nucleic acid population has a sticky end of a known common length extendin^g 
from a tenninal of its double-stranded portion. 

3. A method according to claim 1^ wherein the endonuclease is selected such that each 
sticky end of each nucleic acid in the nucleic acid population has the same known base sequence, 

4. A method according to claim 3, wherein prior to contacting the nucleic acid 
population with the oligonucleotide sequences, the nucleic acid population is contacted with an 
adaptor to ligate the adaptor to a temiinal of each nucleic acid in the nucleic acid population, 
wherein the adaptor comprises a double-stranded primer portion having a known base sequence, 
and a single-stranded portion complementary to the known sticky end of the nucleic acids in the 
nucleic acid popxilation- 

5. A method according to claim 4^ whereiq each oligonucleotide sequence comprises 
a first sequence, a second sequence attached to the first sequence and a third sequence attached 
to the second sequence, in which the first sequence is complementary to the sequence of the 
primer portion of the adaptor, the second sequence is complementary to the kno wn sticky anLd 
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of the nucleic acids in ±e nucleic acid population, and the third sequence comprises the pre- 
determined recognition sequence. 

6. A method according to claim 2, wherein the endonucleasc is selected such that the 
sticky ends of the nucleic acids in the nucleic acid population have apiurality of diifferentbase 
sequences. 

7. A method according to claim 6/ wherein prior to contacting the nucleic acid 
population with the oligonucleotide sequences, the nucleic acid population is contacted with an 
array of adaptors to ligate an adaptor to a tenninal of the nucleic acids in the nucleic acid 
population, wherein each adaptor comprises a double-stranded prim^ portion having a known 
base sequence, and a singje-stranded portion of the same length as the sticky ends of the nucleic 
acids in the nucleic acid population^ all of the possible base sequences of the single-stranded 
portion of the adaptor being represented in the array of adaptors, 

8- A method according to claim 7, wherein each oligonucleotide sequence comprises 

a first sequence, a second sequence attached to the first sequence and a tibuxd sequence attached 
to the second sequence^ in which the first sequence is complementary to the sequence of the 
primer portion of the adaptors, the second sequence is of the same length as the sticky ends of 
the nucleic acids in the nucleic acid population, and the third sequence comprises the pre- 
determined recognition sequence, and wherein in any one group of oligonucleotides having the 
same recognition sequence all of the possible base sequences of the second sequence are 
represented, 

5 v A method according to claim 5 or claim 8, wherein the recognition, sequence consists 

of one base, 

10. Amethodac'cording to claims or claim wherein the recognition sequence consists of two 
or more bases. 

11. A method according to any of claims 5 and 8-10, wherein in any one group of 
oligonucleotides having tihe same recognition sequence the third sequence consists of the 
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recognition sequence and a pre-detennined number of bases situated between the second 
sequence and the recognition sequence, all possible sequences of the pre-determined number of 
bases in the third sequence being represented in that group of oligonucleotides. 

12, A method according to any preceding claim, wherein the nucleic acid population is ampIijSed 
by PGR prior to reaction with the oligonucleotide sequences. 

1 3- A method according to any preceding claim, wherein those nucleic acids are isolated both 
terminals of which correctly hybridise to an oligonucleotide sequence. 

14. A method according to claim 13, wherein a fiist set of oligonucleotide sequences is contacted 
with the nucleic acid population in a first step by denaturing the nucleic acid population in the 
presence of the first set of sequences to produce a single-stranded nucleic acid population and 
allowing the single-stranded nucleic acid to hybridise to the first sequences, immobilising those 
nucleic acids which correctly hybridise to the first sequences^ extending the correctly hybridised 
oligonucleotide sequences along the single-stranded portion of the immobilised nucleic acid to 
form double-stranded nucleic acid, denaturing the double-stranded nucleic acid and removing 
non-immobilised species to isolate the resultiiig immobilised single-stranded nucleic acid, 
contacting the immobilised single-stranded nucleic acid with a second set of oligonucleotide 
sequences In a second step, extending the correctly hybridised oligonucleotide sequences along 
the immobilised single-stranded nucleic acid to form double-stranded nucleic acid, denaturing 
the double-stranded nucleic acid and isolating the resulting non-iramobilised single-stranded 
nucleic acid. 

15. A method according to claim 14, wherein the extended and isolated products of the fii^t step 
and/or the extended and isolated products ofthe second step are amplified by PCR. 

16. A method according to claim 14 or claim 15, wherein the correctly hybridised nucleic acids 
are immobilised by immobilising the oligonucleotide sequences. 
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17. A method according to claim 16, wherein each oligonucleotide in ttic first set of sequences 
carries a biotin residue such that prior to or after hybridising to the nucleic acid the sequeace is 
captured on an avidinated solid phase- 
is, A method according to claim 1 6, wherein each oligonucleotide in the first set of sequences 
is covalently attached to a solid support prior to contacting with the nucleic acid population, 

19. A method according to any of claims 14-1 S, wherein the recognition sequence of the first 
and second set of oligonucleotide sequences consists of one base and, prior to performing the 
first step, the nucleic acid population is sub-di-vided into 16 wells; each well containing 
oligonucleotides firom the first set of sequences having one of the four possible recogoition 
sequences, and wherein in the second step oligonucleotides fixim the second set of sequences are 
added to each well, such that all possible combinations of the identities of the first and second 
set of oligonucleotide sequences and their order of addition to the well are represented in the 16 

.weiisv, 

20* A method according to any of claims 14-1 8, wherein the recognition sequence of the fijst and 
second set of oligonucleotide sequences consists of two bases and, prior to perfoimiag the first 
step> the nucleic acid population is sub*divided into 256 wells, each well containing 
oligonucleotides from the first set of sequences having one of the 16 possible recogtdtion 
sequences, and wherein in the second reaction oligonucleotides from tbe second set of sequences 
are added to each well, such that all possible combinations of the identity of the first and second 
set of oligonucleotide sequences and their order of addition to the well are represented itx the 256 



21. A method according to claim 19, wherein the contents of each pair of wells to which 4.e same 
pair of oligonucleotide sequences were added but in a different order, are combined to give 10 
different wells. 

22- Amethod according to claim 20, whereinthecontents of each pair of wells to which the same 
pairof oHgonucleotide sequences were added but in a different order, are combined to give 136 
different wells. 



wells. 
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23. A method according to any preceding claim, wherein the oligonucleotide sequ^ces have 
equalised melting tempenitures. 

24» A method according to claim 23, wherein the melting temperatures are equah$ed by 
incorporating one or more analogues of natural nucleotides into the oligonucleotide sequences^ 
the analogues comprising base modifications, sugar modifications and/or backbone 
modifications, 

25: A method according to any preceding claim, wherein the endonuclease is selected such that 
it cuts the nucleic acid at a site within the recogaition dte of the endonucle^^^ 

26. A kit for categorising a nucleic acid, comprising one or more adaptors and one or more sets 
of oligonucleotide sequences, wherein the adaptors comprise nucleic acid having a double- 
stranded piimer portion of a known sequence and a single-stranded portion of a pre-deteimined 
lengthy either each single-strandedpordon of each nucleic acid in the adaptors having t^^ 
pre-deteimined sequence or all possible sequences of the single-stranded portion being 
represented in the adaptors/ and wherein each oligonucleotide sequence comprises a first 
sequence, a second sequence attached to the first sequence and a third sequence attached to the 
second sequence, in which the first sequence is complementary to the sequrace of the primer 
portion of the adaptor, the second sequence is the same sequence as the single-stranded portion 
of the adaptors or all possible second sequences of the same length as the single-stranded portion 
of the adaptors are represented within the set of oligonucleotides, and the third sequence 
comprises a pre-determined recognition sequence. 

27. A kit according to claim 26, wherein the recognition sequence consists of one base. 

28. A kit according to claim 26, wherein in the recognition sequence consists of two or more 

bases.' 

29. A kit according to any of claims 26-28, wherein in any one grouqj of oligonucleotides having 
the same recognition sequence, the third sequence consists of the recognition sequence and a pre- 



determined number of bases situated between the second sequence and the recognition sequence, 
all of the possible sequences of the pre-detennined number of bases in the third sequence being 
represented in that group of oligonucleotides. 

30. A kit according to any of claims 26-29, comprising two sets of oligonucleotide sequences, 
each of the oligonucleotides in one set being biotinylated; 

31. A kit according to any of claims 26-29, comprising two sets of oligonucleotide sequences, 
each of the oligonucleotides in one set being covalendy attached to a solid support. 

32- A Idt according to any of claims 26-31; additionally comprising an ^OT^ 

33 / A kit according to claim 32, wherein the endonuclease is selected such that when it is reacted 
with double-stranded nucleic acid, nucleic acids are produced each of which comprises a double- 
stranded portion, 

34. Akit according to claim 33, wherein the endonuclease is selected such that the nucleic acids 
produced have a sticky end of a known common length extending from a tennicxal of the double- 
stranded portion, and wherein each sticky end of each nucleic acid in the nucleic acid population 
has the same known base sequence. 

35. A kit according to claim 33, wherein the endonuclease is selected such that the nucleic acids 
produced have a sticky end of a known common length extending from a texzninal of the double- 
stranded portion, and wherein the sticky ends of the nucleic acids in the nucleic acid population 
exhibit a plurality of different base sequences, 

36. A kit according to any of claims 26-35, wherein the endonuclease is selected such that it cuts 
the nucleic acid at a site within de recognition site of the endonuclease/ ^ ^ ^^^^^^ 
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Claims: 

1, A m&chod for caiegorising nuclefic acid/ which lueihod comprises producicig a 
nucleic acid populatioa by acaoa of an endonuclease on double-stranded nucleic acid, such 
chat each nucleic acid in the nucleic acid populaiion has a double-stranded portion, coniaccing 
the nucleic acid population with one or more oligonucleotide sequences, and isolating: nucleic 
acid which correctly hybridises to an oligonucleotide sequence, wherein each oligonucleotide 
sequence has a pre-<ieKrnxined recognition sequence, die nucleic acid being categorised by its 
ability to cpirectly hybridise to oligonucleotide sequences having the recognition sequence, 
the recognition sequence being situated such that it recognises a sequence in the double- 
stranded portion of the nucleic acid, one or more different recognition sequences being 
represented in the oligonucleotide sequences. 

2, A method according to claim 1, wherein the endonuclease b selected such that 
each nucleic acid in the nucleic acid population has a sticky end of a known common length 
extending from a terminal of its double-stranded portion. 

3 , A method according to claim 1, wherein the- endonuclease is selected such that 

each sticky end of each nucleic acid in the nucleic acid population has the same known base 
sequence. 

4. A method according to claim 3, wherein prior to contacting the nucleic acid 

population with the oligonucleoude sequences/ ihe nucleic acid population is contacted with 
an adaptor to ligate the adaptor to a terminal of each nucleic acid in the nucleic acid 
population, wherein the adaptor comprises a double-stranded primer portion having a known 
base sequence; and a single-stranded ponion complementary to the known sticky end of the 
nucleic acids in the nucleic acid population. 



• 
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5 . A method according to claim 4, wherein each oligonucleotide sequence comprises 

a first sequence, a second sequence attached to liie first sequence and a third sequence 
attached to the second sequence, in which rhe first sequence is complemennary to the 
sequence of the primer ponion of the adaptor, the second sequence is corpplementaiy to the 
known sticky end of the nucleic acids in the nucleic acid population, and the third sequence 
comprises lie prendetermined recognition s^qnence.^ ^ ^^^^^^^^^^ ^^^^^^ ^ .^^ 

6: A meihod according to claim 2, wherein the endonuciease is selected such that the 

scicicy ends of the nucleic acids in the nucleic acid population have a plurality of differeni 
base sequences. 

7. A method according to claim 6, wherein prior lo contacting the nucleic acid 

population with the oligonucleoade sequences, the nucleic acid population is contacted with 
an array of adaptors to ligate an adaptor to a terminal of the nucleic acids in the nucleic acid 
population, wherein each adaptor comprises a double-stranded primer portion having a 
known base sequence, and a single*sxranded portion of the same Itv^gth as the sticiy ends of 
the nucleic acids in the nucleic acid population, all of the possible base sequences of the 
single-stranded porcion of the adaptor being represented in the array of adaptors. 

S. A method accordmg to claim 7/ wherein each oligonucie^^^^ 

a first sequence/ a second sequence attached to the first sequence and a third sequence 
attached to the second sequence, in which the first sequence is complementary to the 
sequence of the primer ponion of the adaptors, the second sequence is of the same length as 
the sticky ends of die nucleic acids in the nucleic acid population, and tlie third sequence 
comprises the pre-detennined recognition sequence, and wherein in any one group of 
oligonucleotides having the sanie recognition sequence all of the possible base sequences of 
the second sequence are represented. 
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9- A method according lo claim 5 or claim 8, wherein the recognition sequence 
consists of one base. 

10- A method according lo claim 5 or claim 8. wherein the recognition sequence 
consists of rwo or more bases. 

11. A method according to any of claims 5 and 8-10, wherein in any one group of 

oligonucleotides having the same recognition sequence the third sequence consists of the 
recognition sequence and a pre-detennined number of bases sioxated between the second 
sequence and the recognicion sequence, all possible sequences of the pre-decennined number 
of bases in the third sequence being represented in that group of oligonucleoudes . 

12- A method according to any preceding claim, wherein the nucleic acid population is 

amplified by PGR prior to reaction with the oligonucleotide sequences. 

13. A method according to a^oy preceding claim, wherein those nucleic acids a^^ 
isolated both terminals of which correctly hybridise to an oligonucleotide sequence, 

14. A method according to claim 13, wherein a first set of oligonucleotide sequences 
is contacted with the nucleic acid population in a first step by denaturing the nucleic acid 
population in the presence of the first set of $eq\iences to produce a single-stranded nucleic 
acid population and allowing the single-stranded nucleic acid to hybridise to the first 
sequences, itamobilising those nucleic acids which correctly hybridise to the first sequences, 
extending the correctly hybridised oligonucleotide sequences along the single-stranded 
ponion of die immobilised nucleic acid to fonn double-stranded nucleic acid, denaturing the 
double-stranded nucleic acid and removing nou-immobilised species to isolate the resulting 
immobilised single-stranded nucleic acid/contacung the immobilised single-stranded nucleic 
acid with a second set of oligonucleotide sequences in a second step, exrending the correcdy 
hybridised oligonucleotide sequences along lie immobilised single-scranded nucleic acid to 
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form doubie-slTOided nucleic acid, demturing the doable-stranded nucleic acid and isoUcbg 
the resulniig iK)iV'immobUised singles acid. 

15. A method according to claim 14, wherein the extended axid isolated products of 
the first step and/or the extended and isolated produces of the second step are amplified by 

PGR- 

16. A method according to claim 15 or claim 16, wherein the correctly hybridised 
nucleic acids are immobilised by immobilising the oligonucleotide sequences, 

17. A method according to claim 16, wherein each oligonucleotide in the first set of 
sequences carries a bioun residue; such that prior to or after hybridising to the nucleic acid the 
sequence is captured on an avidinated solid phase. 

18. A method according to claim 16, wherein each oligonucleotide in the first sec of 
sequences is covalently attached to a solid suppon prior to contacting with tSxe nucleic acid 
population. 

19. A method according to any of claims 14-18, wherein the recognition sequence of 
the first and second set of oligonucleotide sequences consists of one base and, prior to 
perfoimrng the first step, the nucleic acid population is sub-divided into 16 wells, each well 
containing oligonucleotides from the first set of sequences having one of the four possible 
recognition sequences, and wherein in the second step oligonucleotides from the second set of 
sequences arc added to each well, such that all possible combinations of the idenriries of the 
first and second set of oligonucleotide sequences and their order of addition to the well are 
represented in the 16 wells. 

20; A method according to any of claims 14-18, wherein Che recognition sequence of 
the first and second set of oligonucleotide sequences consists of two bases and, prior to 
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performing the first step* the nucleic acid popnlatioa is sufa-divlded into 256 wells, each well 
coacaintng oligomcleotides from die firsr set of sequences baving cue of the 16 possible 
Tecognitioa sequences, and wherein in the second reaction oligonucleotides from the second 
set of sequences are added to each well, such that all possible cotnbinations of die idencicy of 
die first and second set of oligonucleotide sequences and their order of addition to the well 
are represented in die 256 wells. 

21. A method according co claim 19, wherein die contents of each pair of wells to 
which the same pair of oligoxiucleotide sequences were added but in a different order, are 
combined to give 10 different wells. 

22. A method according to claim 20, wherein die contents of each pair of wells to 
which die same pair of oligomicleoude sequences were added but in a different order^ are 
combined to give 136 different wells. 

23. A method according CO any preceding claim, wherein the oligonucleotide 
sequences have equalised melting temperatures. 

24. A method according to claim 23 , wherein the melting temperatures are equalised 
by incorporating one or more analogues of natural nucleotides into the oligonucleotide 
sequences, die analogues comprising base modifications, sugar modifications and/or 
backbone modifications- . 

25. A method according to any preceding claim, wherein the endonuclease is selected 
snch that it cuts die nucleic acid at a site widiin die xecognidon s 

26. A kit for categorising a iiucleic acid, comprising one or more adaptors and one or 
more sets of oligonucleotide sequences, wherein die adaptors comprise nucleic acid having a 
double-siranded primer portion of a fcaown sequence and a siagle^stranded portion of a pre- 
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determined length, either each single-stranded portion of each nucleic acid in the adaptors 
having ±e same pre-determined sequence or all possible sequences of the single-stranded 
portion being represenced in the adaptors, and wherein each oligonucleotide sequence 
comprises a first sequence, a second sequence atcached to the first sequence and a diird 
sequence attached to the second sequence, in which the first, sequence is complementary to 
the sequence of the primer portion of the adaptor, the second sequence is die same sequence 
as the single-stranded portion of ±e adaptors or all possible second sequences of the same 
length as the single-stranded portion of the adaptors are represented within the set of 
oligonucleotides, and the diird sequence comprises a pre-detennined recognition sequence. 

27. A kit according to claim 26, wherein the recognidon sequence consists of one 
■ base, ■ 

28. A kit according to claim 26/ wherein in :he recognition sequence consists of two 
or more bases. 

29. A kit according to any of claims 26-28, wherein in any one group of 
oligonucleotides having the same recognition sequence, the ^^^^c^^^ sequence consists of the 
recognition sequence and a pre-detennined number of bases situated between ihe second 
sequence and the recognition sequence, all of the possible sequences of the pre-decennined 
number of bases in the third sequence being represented in that group of oligonucleotides. 

30/ A kit according to any of clairns 26-29, comprising two sets of oligonucleotide 
sequences, each of the oligonuckotides in oj^ 

3L A kit according to any of claims 26-29, comprising two sets of oligonucleotide 
sequences, each of the oligonucleotides in one set being covalently attached to a solid 

■support,;:' . 




wo 99/02725 PCT/GB98/02043 

.'21' 

32. A kit according to any of claims 26*31, addidonally comprising an endonuclease, 

33 . A kit according to claim 32, wherein die endonuclease is selected such that when it 
is reacted with doublc-$tranded nucleic acid, nucleic acids are produced each of which 
comprises a double-stranded portion. 

34. A kit according to claim 33, wherein the endonuclease is selected such that the 
nucleic acids produced have a sticky end of a known common length extending ficotn a 
teiminal of the double-stranded portion, and wherein each sticky end. of each nucleic acid in 
the nucleic acid populadon has the same known base sequence. 

35. A kit according to claim 33, wherein die endonuclease is seleaed such that the 
nucleic acids produced have a sticky end of a known common length extending from a 
terminal of the double-stranded portion, and wherein the sticky ends of the nucleic acids in 
the nucleic acid population exhibit a plurality of diff^^ 

36. A kit according to any of claims 26-35, wherein the endonuclease is selected such 
that it cuts the nucleic acid at a sire within the recognition site of the endonuclease. 
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□ the claims, Nos.: 

□ the drawings, sheets: 

3. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

4. Additional observations, if necessary: 
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V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 

1. Statement 



Novelty (N) 


Yes: 


Claims 


1-25,29-31,35,36 




No: 


Claims 


26-28,32-34 


Inventive step (IS) 


Yes: 


Claims 


none 




No: 


Claims 


1-36 


Industrial applicability (lA) 


Yes: 


Claims 


1-36 




No: 


Claims 


none 



2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 
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Section V: 

The following documents are referred to in this communication: 

D1: NUCLEIC ACIDS RESEARCH (1. May 1997) 25 (9) pages 1854-1858 
D2: WO 94 01582 A (cited in the application) 

D1 describes a method for the amplification of specific DMA fragments including 
the ligation of adaptor molecules. The double stranded adaptors contain a 
universal primer sequence (M13 sequence), 5 nucleotides complementary to the 
protruding sequence produced by the restriction endonuclease (Bell) and 4 
nucleotides which reside immediately adjacent to the restriction recognition 
sequence in the target molecule (p 1854 col 2 para 2; Fig 1). End-specific 
adaptors with pre-determined indexing sequences are disclosed (Fig 2A) and also 
the use of combinatorial adaptors (p 1857 col 1 para 1; Fig 2B) that contain a 
degenerate mixture of indexing sequences made in one oligodeoxynucleotide 
synthesis. Applications of this method are proposed, namely indexing of DMA 
populations and physical mapping and sequencing of whole genomes or sections 
of complex genomes (p 1857 col 2 para 3- p 1858 col 1 para 1). 

D2 deals with a process for categorizing of nucleotide sequence populations 
(Title). D2 is cited and discussed in the description of the present application 
(appL: p 1 para 4-p 2 para 2). The method of D2 includes the digestion of nucleic 
acid with an endonuclease, ligation of adaptors containing sequences that are 
complementary to the extending cleavage derived sequences (p 18 1 1 1-15) and 
separating adaptored products (p 8 para 4 - p 9 para 2). An embodiment of the 
method includes biontinylated adaptors which allow the capturing of the 
oligonucleotide on a solid phase (p 18 I 24- p 19 I 3; Example 1: p 37 I 5-p 38 I 
16). Moreover the adaptor can comprise a known and selected sequence such 
that specifically adaptored molecules can be amplified by PCR using a primer 
complementary to the core sequence (p 19 I 3-7; Fig.1). 

1. Novelty 

1.1 Claim 1 and the dependent claims 2-25 are novel (Art 33(2)PCT), because in 
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none of the available documents all its technical features are disclosed: 

- a method for categorizing nucleic acid comprising 

- producing a nucleic acid population by action of an endonuclease on nucleic acid 

- contacting the nucleic acid population with an oligonucleotide that contains a 
recognition sequence which enables a sequence specific hybridization with a 
double stranded part of some nucleic acids comprised in the population 

- isolating nucleic acid by capturing the oligonucleotide on a solid phase 

1 .2 Claim 26 relates to a kit containing the components to carry out the method 
disclosed in the application. It is not novel (Art 33(2) PCT) because D2 discloses a 
kit comprising adaptors, endonuclease e.g. Fokl and oligonucleotide sequences 
used as PGR primers (D2: p 23 para 2; claims 29-34), The PGR primers comprise 
in some embodiments a sequence complementary to the core sequence of the 
adaptors (" first sequence") and may preferably extend by one or more specific 
extra bases into the adaptored fragment ("third sequence") (p 28 I 25- p 29 I 3). 
This implies, that the oligonucleotide sequences also contain the sequence of the 
single stranded portion of the adaptor ("second sequence"). Thus, these primers 
contain all the technical features of the oligonucleotide sequences claimed in 
claim 26. The same applies to claim 27-28, 32-34. 

1 .3 Claims 29-31, 35 and 36 which are dependent on claim 26 are novel, because 
they contain features that are not disclosed in D2 (cf. point 1.2 above). 

2 Inventive Step : 

2.1 Claim 1 does not meet the requirements of Art 33(3)PCT for the following 
reasons: 

D2, which is considered to represent the closest prior art, is distinguished from the 
subject-matter of claim 1 in that the oligonucleotide adaptors that correctly 
hybridize to the protruding single stranded portion of the nucleic acid are used to 
categorize the nucleic acid whereas according to claim 1 the oligonucleotide 
recognizes a sequence in the double stranded portion of the nucleic acid. 

The problem to be solved over D2 can thus be regarded as how to provide a 
method for categorizing nucleic acid on a wider basis than the sequence of the 
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sticky ends (appl. p 2 para 2). 

In order to solve the problem a skilled person would turn to D1, because D1 deals 
with indexing of DNA applicable for the accession and physical mapping of 
genomic DNA (abstract; p 1856 col 1 para 2; p 1857 col 2 para 3), 
"Indexing" is understood to be a specific embodiment of "categorizing" as 
explained in D2 (p1 para 1; p 14 I 3-8; p 53 I 1-13). Indexing embraces the 
categorizing and the positioning of a marker (adaptor) at a predetermined site in a 
sequence. 

The indexing sequence (D1: Fig 1) contained in the oligonucleotide adaptor 
hybridizes with a double stranded portion of the target nucleic acid. Thus, the 
adaptor of D1 has a predetermined recognition sequence that recognises a 
sequence in the double-stranded portion of the nucleic acid. By reacting the 
adaptor to a population of double stranded nucleic acids, that has been digested 
by an endonuclease, the adaptor hybridizes to nucleic acid molecules containing 
the recognition sequence in their double stranded portion. Hence, an adaptor 
according to D1 (e.g. Fig. 1) shows all technical features of the oligonucleotide 
sequence according to claim 1. 

Thus, combining the general method of D2 with the adaptor oligonucleotide of D1 
renders claim 1 obvious for a skilled person (Art 33(3)PCT). 
The same applies to the dependent claims 2,3, 6-8, 10, 11-13 as they do not 
contain an inventive concept per se. 

2.2 Claim 4 relates to a method in which an oligonucleotide adaptor is ligated first to 
the nucleic acids and then another oligonucleotide is used to hybridize with its 
recognition sequence on the double stranded portion of the nucleic acid before 
isolating the nucleic acid which hybridizes correctly with the oligonucleotide 
sequence. 

In D2, which is considered to represent the closest prior art, adaptor 
oligonucleotides are described that contain a known and selected sequence such 
that nucleic acids linked to an adaptor can be amplified by PGR using a primer 
complementary to the core sequence (p 19 I 3-7). These adaptors appear to 
contain the same technical features as the adaptors of the present claim 4. 
D2 is distinguished from the subject-matter of claim 4 in that in D2 the nucleic 
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acids that correctly hybridize with the adaptor oligonucleotide are isolated 
whereas in the method according to claim 4 the nucleic acids are isolated that 
correctly hybridize to another oligonucleotide. Moreover, the adaptor disclosed in 
D2 hybridizes to a single stranded portion of the nucleic acid (see point 2 above). 
The problem to be solved over D2 can also be regarded as how to provide 
method for categorizing nucleic acid on a wider basis than the sequence of the 
sticky ends (appi, p 2 para 2). 

The method of D2 including the amplification of the adaptored nucleic acids by 
primers that hybridize with a complementary portion of the adaptor (D2: p 19 I 3-7) 
is suitable for categorizing nucleic acids on a wider basis than the sequence of the 
sticky ends. In D2 the general technique is disclosed for the isolation of adaptored 
nucleic acids (p 18 I 24- p 19 I 3 ; p 28 112-p 29 I 3). In addition the physical 
separation of the initial adaptored nucleic acids is discussed and it is stated, that it 
is not strictly necessary, if a PCR-type process using selected primers is 
employed (p 28 I 17-19). Moreover, subsets of sequences obtained by the method 
of D2 can themselves be immobilized by standard techniques for further analysis 
(p 32 I 26-p 33 I 2). 

Thus, a skilled person would be able to isolate the amplification products by 
common technology without being inventive (Art 33(3) PCT). The same applies to 
the dependent claims 5,9-25 as they do not contain an inventive concept per se. 

2.3 Although claims 29-31, 35 and 36 which are dependent on claim 26 contain new 
technical features they do not appear to be inventive as they appear to be simple 
modifications that can be achieved using common technology by a skilled person 
(Art 33(3)PCT). 

Section VII: 

- Contrary to the requirements of Rule 5.1(a)(ii) PCT, the relevant background art 
disclosed in the document D1 is not mentioned in the description, nor is this 
document identified therein. 

- In order for the application to be self-contained, patent application numbers 
should have been replaced by the corresponding publication numbers (e.g. p 1 
para 4; p 6 I 1). 
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Section V: 

The following documents are referred to in this communte^^ 

D1: NUCLEIC ACIDS RESEARCH (1 . May 1997) 25 (9) pages 1854-1858 
D2i WO 94 01582 A (cited in the application) 

D1 describes a method for the amplification of specific DNA fragments including 
the ligation of adaptor molecules. The double stranded adaptors contain a 
universal primer sequence (M1 3 sequence), 5 nucleotides complementary to the 
protruding sequence produced by the restriction endonuolease (edl) and 4 
nucleotides which reside immediately adjacent to the restriction recognition 
sequence in the target molecule (p 1854 col 2 para 2; fig 1). End-specific 
adaptors with pre-determined indexing sequences are disclosed (Fig 2A) and also 
the use of combinatorial adaptors (p 1 857 col l para 1 ; Fig 2B) that contain a 
degenerate mixture of indexing sequences made in one oligodeoxynucleotide 
synthesis. Applications of this method are proposed, namely indexing of DNA 
populations and physical mapping and sequencing of whole genomes or sections 
of complex genomes (p 1857 col 2 para 3- p 1858 col 1 paral). 

D2 deals with a process for categorizing of nucleotide sequence populations 
(Title). D2 Is cited and discussed in the descnption of the present 
(appl.: pi para 4-p 2 para 2), The method of D2 includes the digestion ofnucleic 
acid with an endonuciease, ligation of adaptors containing sequences that are 
complementary to theextending cleavage derived sequences (p 18 1 11-15) and 
separating adaptored products (p 8 para 4 - p 9 para 2). An embodiment of the 
method includes biontinylated adaptors which allow the capturing of the 
oligonucleotide on a solid phase (p i 8 1 24- p 19 1 3; Example 1 ; p 37 r^^^^^ 
16), Moreover the adaptor can comprise a known and selected sequence such 
that specificafly adaptored molecules can be amplified by PGR using a primer 
complementary to the core sequence (p 19 13-7; FIg.l). 

1. Novelty 

1.1 Claim 1 and the dependent claims 2-25 are novel (Art 33(2)PCT), because in 
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none of the available documents all its technical features are dfeclosed: 

- a method for categorizing nucleic acid comprising 

- producing a nucleic acid population by action of an endonuclease on nucleic acid 

- contacting the nucleic acid population with an oligonucleotide that contains a 
recognition sequence which enables a sequence specific hybridization with a 
double stranded part of some nucleic acids comprised in the population 

- Isolating nucleic acid by capturing the oligonucleotide on a solid phase 

1 .2 Claim 26 relates to a kit containing the components to carry out the method 
disclosed in the application. It is not novel (Art 33(2) PCT) because D2 discloses a 
kit comprising adaptors, endonuclease e.g, Fokl and oligonucleotide sequences 
used as PGR primers (D2: p 23 para 2; claims 29-34). The PCR primers comprise 
In some embodiments a sequence complementary to the core sequence of the 
adaptors (" first sequence") and may preferably extend by one or more specific 
extra bases Into the adaptored fragment ("third sequence") (p 28 1 25- p 29 1 3). 
This implies, that the oligonucleotide sequences also contain the sequence of the 
single stranded portion of the adaptor ("second sequence"). Thus, these primers 
contain all the technical features of the oligonucleotide sequences claimed in 
claim 26. The same applies to claim 27-28, 32-34. 

1.3 Claims 29-31, 35 and 36 which are dependent on claim 26 are novel, because 
they contain features that are not disclosed in D2 (cf. point 1.2 above). 

2 Inventive Step: 

2.1 Claimi does not meet the requirements of Art 33(3) PCT for the following 
.. reasons:- ; .. ^ 

D2, which is conslderedto represent the closest prior art, is distinguished from the 
subject-matter of claim 1 in that the oligonucleotide adaptors that correctly 
hybridize to the protruding single stranded portion of the nucleic acid are used to 
categorize the nucleic acid whereas according to claim t the o^^^ 
recognizes a sequence in the double stranded portion of the nucleic aci 

The problem to be solved over D2 can thus be regarded as how to provide a 
method for categorizing nucleic acid on a wider basis than the sequence of the 
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stfcky ends (appL p 2 para 2). 

In order to solve the problem a skilled person would turn to D1, because D1 deals 
with Indexing of DNA applicable for the accession and physical mapping of 
genomic DNA (abstract; p i 856 col 1 para 2; p 1857 col 2 para 3). 
"Inde^dng" is understood to be a specific embodiment of "categorizing" as 
explained In D2 (pi para 1; p 141 3-8; p S3 1 1-13). Indexing embraces the 
categorizing and the positioning of a marker (adaptor) at a predetermined site In a 
sequence. 

The Indexing sequence (D1: Fig 1) contained In the oligonucleotide adaptor 
hybridizes with a double stranded portion of the target nucleic 
adaptor of D1 has a predetermined recognition sequence that recognises a 
sequence in the double-stranded portion of the nucleic acid By reacting the 
adaptor to a population of double stranded nuclalc acids, that has been digested 
by an endonuolease, the adaptor hybridtees to nucleic acid molecules containing 
the recognition sequence In their double stranded portion. Hence, an adaptor 
according to D1 (e.g. Fig. 1 ) shows alt technical features of the oligonucleotide 
sequence according to claim 1. 

Thus, combining the general method of D2 with the adaptor oligonucleotide of D1 
renders claim 1 obvious for a skilled person (Art 33(3)PCT), 
The same applies to the dependent claims 2,3, 6-8, 10, 11-13 as they do not 
contain an inventive concept per se. 

2.2 Claim 4 relates to a method in which an oligonucleotide adaptor Is ((gated first to 
the nucleic acids and then another oligonucleotide is used to hybridize with its 
recognition sequence on the double stranded portion of the nucleic acid before 
isolating the nucleic acid which hybridizes correctly with the oligonucleotide 
sequence. 

In D2, which is considered to represent the closest prior art, adaptor 
olfgonudeotldes are described that contain a known and selected sequence such 
that nucleic acids linked to an adaptor can be amplified by PGR using a primer 
complementary to the core sequence (p 1 9 1 3-7). These adaptors appear to 
contain the same technical features as the adaptors of the present claim 4, 
D2 is distinguished from the subject-matter of claim 4 in that In D2 the nucleic 
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acids that correctly hybridize with the adaptor oligonucleotide are Isolated 
whereas in the method according to claim 4 the nucleic acids are Isolated that 
correctly hybridize to another oligonucleotide. Moreover, the adaptor disclosed in 
D2 hybridizes to a single stranded portion of the nucleic acid (see point 2 above)/ 
The problem to be solved over D2 can also be regarded as how to provide 
method for categorizing nucleic acid on a wider basis than the sequence of the 
sticky ends (appL p 2 para 2). 

The method of D2 including the amplification of the adaptored nucleic acids by 
primers that hybridize with a complementary portion of the adaptor (D2: p 19 I 3-7) 
is suitable for categorizing nucleic acids on a wider basis than the sequence of the 
sticlcy ends, tn D2 the general technique is disclosed for the isolation of adaptored 
nucleic acids (p 18 1 24- p 19 13 ; p 28 1 12-p 29 1 3). In addition the physical 
separation of the initial adaptored nucleic acids is discussed and It is stated, that It 
is not strictly necessary, If a PCR-type process using selected primers is 
employed (p 28 1 17-19), Moreover, subsets of sequences obtained by the method 
of D2 can themselves be immobilized by standard techniques for further analysis 
(p 32 I 26-^) 33 I 2). 

Thus, a skilled person would be able to isolate the amplification products by 
common technology without being Inventive (Art 33(3)PCT). The same applies to 
the dependent claims 5,9-25 as they do not contain an Inventive concept per se. 

2.3 Although clainne 29-31, 35 and 36 which are dependent on claim 26 contain new 
technical features they do not appear to be Inventive as they appear to be simple 
m edifications that can be ach ieved using common technology by a skilled person 
(Art 33(3)PCT). 

Section V)l: 

- Contrary to the requirements of Rule 5.1(a)(ll) PCT;the relevant background art 
disclosed in the document DI is not mentioned in the description, nor Is thls 
document Identified therein, 

- In order for the application to be self-contained, patent application numbers 
should have been replaced by the corresponding publication numbers (e*g. pi 
para4; p6 i 1). 
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